7,289 research outputs found

    Effects of training datasets on both the extreme learning machine and support vector machine for target audience identification on twitter

    Get PDF
    The ability to identify or predict a target audience from the increasingly crowded social space will provide a company some competitive advantage over other companies. In this paper, we analyze various training datasets, which include Twitter contents of an account owner and its list of followers, using features generated in different ways for two machine learning approaches - the Extreme Learning Machine (ELM) and Support Vector Machine (SVM). Various configurations of the ELM and SVM have been evaluated. The results indicate that training datasets using features generated from the owner tweets achieve the best performance, relative to other feature sets. This finding is important and may aid researchers in developing a classifier that is capable of identifying a specific group of target audience members. This will assist the account owner to spend resources more effectively, by sending offers to the right audience, and hence maximize marketing efficiency and improve the return on investment

    Using support vector machine ensembles for target audience classification on Twitter

    Get PDF
    The vast amount and diversity of the content shared on social media can pose a challenge for any business wanting to use it to identify potential customers. In this paper, our aim is to investigate the use of both unsupervised and supervised learning methods for target audience classification on Twitter with minimal annotation efforts. Topic domains were automatically discovered from contents shared by followers of an account owner using Twitter Latent Dirichlet Allocation (LDA). A Support Vector Machine (SVM) ensemble was then trained using contents from different account owners of the various topic domains identified by Twitter LDA. Experimental results show that the methods presented are able to successfully identify a target audience with high accuracy. In addition, we show that using a statistical inference approach such as bootstrapping in over-sampling, instead of using random sampling, to construct training datasets can achieve a better classifier in an SVM ensemble. We conclude that such an ensemble system can take advantage of data diversity, which enables real-world applications for differentiating prospective customers from the general audience, leading to business advantage in the crowded social media space

    Ranking of high-value social audiences on Twitter

    Get PDF
    Even though social media offers plenty of business opportunities, for a company to identify the right audience from the massive amount of social media data is highly challenging given finite resources and marketing budgets. In this paper, we present a ranking mechanism that is capable of identifying the top-k social audience members on Twitter based on an index. Data from three different Twitter business account owners were used in our experiments to validate this ranking mechanism. The results show that the index developed using a combination of semi-supervised and supervised learning methods is indeed generic enough to retrieve relevant audience members from the three different data sets. This approach of combining Fuzzy Match, Twitter Latent Dirichlet Allocation and Support Vector Machine Ensemble is able to leverage on the content of account owners to construct seed words and training data sets with minimal annotation efforts. We conclude that this ranking mechanism has the potential to be adopted in real-world applications for differentiating prospective customers from the general audience and enabling market segmentation for better business decision making

    Identifying the high-value social audience from Twitter through text-mining methods

    Get PDF
    Doing business on social media has become a common practice for many companies these days. While the contents shared on Twitter and Facebook offer plenty of opportunities to uncover business insights, it remains a challenge to sift through the huge amount of social media data and identify the potential social audience who is highly likely to be interested in a particular company. In this paper, we analyze the Twitter content of an account owner and its list of followers through various text mining methods, which include fuzzy keyword matching, statistical topic modeling and machine learning approaches. We use tweets of the account owner to segment the followers and identify a group of high-value social audience members. This enables the account owner to spend resources more effectively by sending offers to the right audience and hence maximize marketing efficiency and improve the return of investment

    Can education improve tax compliance? Evidence from different forms of tax education

    Full text link
    We examine whether tax compliance is improved via different forms of tax education. We argue that different types of tax education have respective impacts on tax compliance. To explore this empirical issue, we conduct a survey related to tax compliance among 205 students taking either a general tax course or a technical tax course in Hong Kong. Our findings suggest that sales tax compliance among undergraduate students was significantly improved if they had been exposed to a general tax education, and income and sales tax compliance among postgraduate students were significantly improved if they had taken a technical tax course

    Longitudinal effects of metabolic syndrome on Alzheimer and vascular related brain pathology.

    Get PDF
    Background/aimsThis study examines the longitudinal effect of metabolic syndrome (MetS) on brain-aging indices among cognitively normal (CN) and amnestic mild cognitive impairment (aMCI) groups [single-domain aMCI (saMCI) and multiple-domain aMCI (maMCI)].MethodsThe study population included 739 participants (CN = 226, saMCI = 275, and maMCI = 238) from the Alzheimer's Disease Neuroimaging Initiative, a clinic-based, multi-center prospective cohort. Confirmatory factor analysis was employed to determine a MetS latent composite score using baseline data of vascular risk factors. We examined the changes of two Alzheimer's disease (AD) biomarkers, namely [(18)F]fluorodeoxyglucose (FDG)-positron emission tomography (PET) regions of interest and medial temporal lobe volume over 5 years. A cerebrovascular aging index, cerebral white matter (cWM) volume, was examined as a comparison.ResultsThe vascular risk was similar in all groups. Applying generalized estimating equation modeling, all brain-aging indices declined significantly over time. Higher MetS scores were associated with a faster decline of cWM in the CN and maMCI groups but with a slower decrement of regional glucose metabolism in FDG-PET in the saMCI and maMCI groups.ConclusionAt the very early stage of cognitive decline, the vascular burden such as MetS may be in parallel with or independent of AD pathology in contributing to cognitive impairment in terms of accelerating the disclosure of AD pathology

    The Role of Linguistics Habits and Proprioception Sense on Mental Number Line

    Get PDF
    AbstractThe present study investigated the relationship between linguistic styles and numerical representation as evidenced by the Spatial Numerical Association of Response Codes effect, which smaller (larger) number is respond faster with left (right) hand. In Study 1, sixteen participants did numerical judgments by pressing left-or-right side keyboard button with digits presented randomly on three sides of the screen. SNARC effect was found to be significant whereas no significant main effect of number placement was found. In Study 2, fifteen participants reacted with moving the joystick to leftward or rightward with one hand only. No SNARC effect was found as well as the interaction effect. These results suggested the role of proprioception sense in the conceptualization of mental number line

    Integration of Genome-Wide Computation DRE Search, AhR ChIP-chip and Gene Expression Analyses of TCDD-Elicited Responses in the Mouse Liver

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>The aryl hydrocarbon receptor (AhR) is a ligand-activated transcription factor (TF) that mediates responses to 2,3,7,8-tetrachlorodibenzo-<it>p</it>-dioxin (TCDD). Integration of TCDD-induced genome-wide AhR enrichment, differential gene expression and computational dioxin response element (DRE) analyses further elucidate the hepatic AhR regulatory network.</p> <p>Results</p> <p>Global ChIP-chip and gene expression analyses were performed on hepatic tissue from immature ovariectomized mice orally gavaged with 30 μg/kg TCDD. ChIP-chip analysis identified 14,446 and 974 AhR enriched regions (1% false discovery rate) at 2 and 24 hrs, respectively. Enrichment density was greatest in the proximal promoter, and more specifically, within ± 1.5 kb of a transcriptional start site (TSS). AhR enrichment also occurred distal to a TSS (e.g. intergenic DNA and 3' UTR), extending the potential gene expression regulatory roles of the AhR. Although TF binding site analyses identified over-represented DRE sequences within enriched regions, approximately 50% of all AhR enriched regions lacked a DRE core (5'-GCGTG-3'). Microarray analysis identified 1,896 number of TCDD-responsive genes (|fold change| ≥ 1.5, P1(t) > 0.999). Integrating this gene expression data with our ChIP-chip and DRE analyses only identified 625 differentially expressed genes that involved an AhR interaction at a DRE. Functional annotation analysis of differentially regulated genes associated with AhR enrichment identified overrepresented processes related to fatty acid and lipid metabolism and transport, and xenobiotic metabolism, which are consistent with TCDD-elicited steatosis in the mouse liver.</p> <p>Conclusions</p> <p>Details of the AhR regulatory network have been expanded to include AhR-DNA interactions within intragenic and intergenic genomic regions. Moreover, the AhR can interact with DNA independent of a DRE core suggesting there are alternative mechanisms of AhR-mediated gene regulation.</p

    InnateDB: systems biology of innate immunity and beyond—recent updates and continuing curation

    Get PDF
    peer-reviewedInnateDB (http://www.innatedb.com) is an integrated analysis platform that has been specifically designed to facilitate systems-level analyses of mammalian innate immunity networks, pathways and genes. In this article, we provide details of recent updates and improvements to the database. InnateDB now contains >196 000 human, mouse and bovine experimentally validated molecular interactions and 3000 pathway annotations of relevance to all mammalian cellular systems (i.e. not just immune relevant pathways and interactions). In addition, the InnateDB team has, to date, manually curated in excess of 18 000 molecular interactions of relevance to innate immunity, providing unprecedented insight into innate immunity networks, pathways and their component molecules. More recently, InnateDB has also initiated the curation of allergy- and asthma-related interactions. Furthermore, we report a range of improvements to our integrated bioinformatics solutions including web service access to InnateDB interaction data using Proteomics Standards Initiative Common Query Interface, enhanced Gene Ontology analysis for innate immunity, and the availability of new network visualizations tools. Finally, the recent integration of bovine data makes InnateDB the first integrated network analysis platform for this agriculturally important model organism.This work was supported by Genome BC through the Pathogenomics of Innate Immunity (PI2) project and by the Foundation for the National Institutes of Health and the Canadian Institutes of Health Research under the Grand Challenges in Global Health Research Initiative [Grand Challenges ID: 419]. Further funding was also provided by AllerGen grants 12ASI1 and 12B&B2. D.J.L. was funded in part during this project by a postdoctoral trainee award from the Michael Smith Foundation for Health Research (MSFHR). F.S.L.B. is a MSFHR Senior Scholar and R.E.W.H. holds a Canada Research Chair (CRC). Funding to enable bovine systems biology in InnateDB is provided by Teagasc [RMIS6018] and the Teagasc Walsh Fellowship scheme. IMEx is funded by the European Commission under the PSIMEx project [contract number FP7-HEALTH-2007-223411]. Funding for open access charge: Teagasc [RMIS6018]
    corecore